Confounding from Cryptic Relatedness in Case-Control Association Studies

نویسندگان

  • Benjamin F Voight
  • Jonathan K Pritchard
چکیده

Case-control association studies are widely used in the search for genetic variants that contribute to human diseases. It has long been known that such studies may suffer from high rates of false positives if there is unrecognized population structure. It is perhaps less widely appreciated that so-called "cryptic relatedness" (i.e., kinship among the cases or controls that is not known to the investigator) might also potentially inflate the false positive rate. Until now there has been little work to assess how serious this problem is likely to be in practice. In this paper, we develop a formal model of cryptic relatedness, and study its impact on association studies. We provide simple expressions that predict the extent of confounding due to cryptic relatedness. Surprisingly, these expressions are functions of directly observable parameters. Our analytical results show that, for well-designed studies in outbred populations, the degree of confounding due to cryptic relatedness will usually be negligible. However, in contrast, studies where there is a sampling bias toward collecting relatives may indeed suffer from excessive rates of false positives. Furthermore, cryptic relatedness may be a serious concern in founder populations that have grown rapidly and recently from a small size. As an example, we analyze the impact of excess relatedness among cases for six phenotypes measured in the Hutterite population.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Population Structure in Genetic Association Studies

Standard genetic association tests using case-control data are based on certain assumptions about the population from which study subjects were sampled. Two types of departure from these assumptions have been studied: population stratification and cryptic relatedness. Both types of departure have been called population structure. Each can lead to erroneous inferences due to differences between ...

متن کامل

Correcting for cryptic relatedness in population-based association studies of continuous traits.

Cryptic relatedness was suggested to be an important source of confounding in population-based association studies (PBAS). The magnitude and manner of cryptic relatedness affecting the performance of PBAS of continuous traits remain to be investigated. We simulated a set of related samples through biased sampling and inbreeding, and evaluated the power and type I error rates of simple associati...

متن کامل

Population Structure and Cryptic Relatedness in Genetic Association Studies

We review the problem of confounding in genetic association studies, which arises principally because of population structure and cryptic relatedness. Many treatments of the problem consider only a simple “island” model of population structure. We take a broader approach, which views population structure and cryptic relatedness as different aspects of a single confounder: the unobserved pedigre...

متن کامل

The confounding effect of cryptic relatedness for environmental risks of systolic blood pressure on cohort studies

The impact of cryptic relatedness (CR) on genomic association studies is well studied and known to inflate false-positive rates as reported by several groups. In contrast, conventional epidemiological studies for environmental risks, the confounding effect of CR is still uninvestigated. In this study, we investigated the confounding effect of unadjusted CR among a rural cohort in the relationsh...

متن کامل

An analytical comparison of the principal component method and the mixed effects model for association studies in the presence of cryptic relatedness and population stratification.

The principal component method and the mixed effects model represent two popular approaches to controlling for population structure and cryptic relatedness in genetic association studies. There are only a handful of studies comparing their performance. These studies are typically based on simulation studies and the results are therefore limited in their applicability. In this paper, we conduct ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Genetics

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2005